Ricardo Baeza - Yates

نویسندگان

Ricardo Baeza-Yates

Yoelle Maarek

Thomas Roelleke

Arjen P. de Vries

چکیده

The XML Fragment model offers a convenient formalism for querying XML collections "by example", that is, by formulating the query as a piece of XML that expresses the user's needs. This allows relevant results to be returned either as full documents or as XML Fragments, using a simple extension of the vector space model for ranking. In this work, we investigate extending this model to text analytics applications where semantic tags (e.g., names, entities, relations, etc.) are automatically generated to annotate the underlying text. Each type of tag can easily be represented as an XML element, but the spans of these tags often cross over each other, which, of course, is not allowed by the XML DOM structure. We discuss how our original XML Fragments model can be extended to query annotated documents with possibly overlapping annotations and illustrate our approach with examples of queries over annotated documents generated in the context of IBM’s Unstructured Information Management Architecture (UIMA) framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handling Proximity for Text Problems

متن کامل

Diseñemos Todo de Nuevo: Reflexiones sobre la Computación y su Enseñanza (Invited paper)

What and how to teach are the fundamental questions in our activities as lecturers. This paper presents my view on these questions related to computer science, and illustrates a critical and constructive analysis and its implications in the education, including two partial answers to these questions. REVISTA COLOMBIANA DE COMPUTACIÓN Volumen 1, número 1 Págs. 7-28 Ricardo Baeza Yates 2

متن کامل

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Ricardo Baeza - Yates

نویسندگان

چکیده

منابع مشابه

Handling Proximity for Text Problems

Diseñemos Todo de Nuevo: Reflexiones sobre la Computación y su Enseñanza (Invited paper)

A Comparison of Open Source Search Engines

Relating Web Structure, User Search Behavior

Modern Information Retrieval - the concepts and technology behind search, Second edition

Balancing Volume, Quality and Freshness in Web Crawling

عنوان ژورنال:

اشتراک گذاری